On-line predictive linear regression

نویسندگان

  • Vladimir Vovk
  • Ilia Nouretdinov
  • Alex Gammerman
چکیده

We consider the on-line predictive version of the standard problem of linear regression; the goal is to predict each consecutive response given the corresponding explanatory variables and all the previous observations. The standard treatment of prediction in linear regression analysis has two drawbacks: (1) the usual prediction intervals guarantee that the probability of error is equal to the nominal significance level 2, but this property per se does not imply that the long-run frequency of error is close to 2; (2) it is not suitable for prediction of complex systems as it assumes that the number of observations exceeds the number of parameters. We state a recent result in machine learning showing that in the on-line protocol the frequency of error does equal the nominal significance level, up to statistical fluctuations, and we describe alternative regression models in which informative prediction intervals can be found before the number of observations exceeds the number of parameters. One of these models, which only assumes that the observations are independent and identically distributed, is popular in machine learning but greatly underused in the statistical theory of regression.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Eager Regression Method Based on Selecting Appropriate Features

This paper describes a machine learning method, called Regression by Selecthtg Best P~’ttllll’es (RSBF). RSBF consists of two phases: The first phase aims to find the predictive power of each feature by constructing simple linear regression lines, one per each continuous feature and number of categories pen each categorical feature. Although the predictive power of a continuous feature is const...

متن کامل

Adaptive Predictive Controllers Using a Growing and Pruning RBF Neural Network

An adaptive version of growing and pruning RBF neural network has been used to predict the system output and implement Linear Model-Based Predictive Controller (LMPC) and Non-linear Model-based Predictive Controller (NMPC) strategies. A radial-basis neural network with growing and pruning capabilities is introduced to carry out on-line model identification.An Unscented Kal...

متن کامل

An Eager Regression Method Based on Best Feature Projections

This paper describes a machine learning method, called Regression by Selecting Best Feature Projections (RSBFP). In the training phase, RSBFP projects the training data on each feature dimension and aims to find the predictive power of each feature attribute by constructing simple linear regression lines, one per each continuous feature and number of categories per each categorical feature. Bec...

متن کامل

A comparative QSAR study of aryl-substituted isobenzofuran-1(3H)-ones inhibitors

A comparative workflow, including linear and non-linear QSAR models, was carried out to evaluate the predictive accuracy of models and predict the inhibition activity of a series of aryl-substituted isobenzofuran-1(3H)-ones. The data set consisted of 34 compounds was classified into the training and test sets, randomly. Molecular descriptors were selected using the genetic algorithm (GA) as a f...

متن کامل

Predictive factors of glycosylated hemoglobin using additive regression model

Introduction: Diabetes is a chronic disease, non-epidemic disease that costs a lot of money in each year. One of the diagnostic criteria for diabetes is Glycosylated Hemoglobin (HBA1C), which in this study the effective factors on it examined by additive regression model. Materials and Methods: In this cross-sectional study, 130 patients with diabetes type-2 were selected based on simple random...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005